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ABSTRACT 

Recent publications claim that there is no convincing evidence for measurements of 
the baryonic acoustic (BAO) feature in galaxy samples using cither monopole or radial 
information. Different claims seem contradictory: data is either not consistent with the 
BAO model or data is consistent with both the BAO model and featureless models 
without BAO. We investigate this point with a set of 216 realistic mock galaxy catalogs 
extracted from MICE7680, one of the largest volume dark matter simulation run to 
date, with a volume of 1300 cubical gigaparsecs. Our mocks cover similar volume, 
densities and bias as the real galaxies and provide 216 realizations of the Lambda 
or w = -1 Cold Dark Matter (wCDM) BAO model. We find that only 20% of the 
mocks show a statistically significant (3 sigma) preference for the true (input) wCDM 
BAO model as compared to a featureless (non-physical) model without BAO. Thus the 
volume of current galaxy samples is not yet large enough to claim that the BAO feature 
has been detected. Does this mean that we can not locate the BAO position? Using a 
simple (non optimal) algorithm we show that in 50% (100%) of the mocks we can find 
the BAO position within 5% (20%) of the true value. These two findings are not in 
contradiction: the former is about model selection, the later is about parameter fitting 
within a model. We conclude that current monopole and radial BAO measurements 
can be used as standard rulers if we assume wCDM type of models. 
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1 INTRODUCTION 

Primordial fluctuations generated acoustic waves in the 
early universe photon-baryon plasma. Those waves were 
frozen at decoupling, z ~ 1100, then baryon acoustic os- 
cillations (BAO) were imprinted in the cosmic microwave 
background (CMB) at the sound horizon scale, as a series 
of peaks in the power spectrum or a single peak in the 2- 
point correlation function (see eg Peebles and Yu, 1970 and 
Komatsu et al 2010 for the latest measurements by WMAP). 

BAO can also be seen at the present in matter power 
spectrum, and its position, tbao can be used as a stan- 
dard cosmological ruler. Measurements in the radial (red- 
shift direction), Az, can be used to estimate the Hubble 
rate as H(z) = cAz/tbao, while angular measurements, 
A9, can be used to estimate the angular diameter distance: 
Da(z) — tbao/AO. Baryon acoustic oscillations in the 
galaxy correlations of the Sloan Digital Sky Survey (SDSS) 
luminous red galaxy (LRG) sample have been used to con- 
strain cosmological parameters (eg Eisenstein et al 2005, 
Hutsi et al 2006, Sanchez et al 2009, Percival et al 2010, 



Reid et al 2010, Kazin et al 2010a and references therein). 
Different studies use different ways to extract the BAO sig- 
nal and quantify the significance of the measurements (see 
Sanchez et al 2008). For example, Eisenstein et al 2005 and 
Sanchez et al 2009 used the full shape of the 2-point cor- 
relation to ojCDM class of models and found constraints to 
the combination distance D v (z) = (D 2 A / H) 1 ^ to the galaxy 
sample mean redshift based on a global \ fitting, while Per- 
cival et al 2010 used a fit to the oscillatory components in 
the power spectrum to find constraints on D v (z). 

These previous analysis used the monopole component 
of the correlation function, where all pairs are averaged with 
independence of their orientation. Okumura et al 2008 did a 
separate analysis of pairs as a function of orientation but 
avoiding the radial direction. Gaztanaga, Cabre and Hui 
(2009, GCH hereafter) presented constraints to H(z) based 
on the radial correlation, which uses only those pairs aligned 
with the redshift direction. This reduces the number of ob- 
servational data but boost the contrast on the BAO peak 
because of redshift space distortions. At intermediate scales, 
lower than BAO, the correlation function becomes negative 
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in the line-of-sight direction, creating a better contrast in 
the BAO position, easier to detect than in real space. Also, 
non-linearities, magnification and bias can boost the peak 
(see GCH and Tian et al 2010 for further details). 

GCH presented two ways to analyze the BAO data: the 
peak and the shape method. In the peak method they find 
the location of the peak and use it as standard ruler to mea- 
sure H(z). In the shape method they use a x 2 fit to the full 
shape of the correlation and find the best shift in the dis- 
tance H(z)/Hq. The shape method was also used to test if 
the data was compatible with the shape of the correlation ex- 
pected in wCDM. They compare different classes of models: 
the standard BAO cjCDM model, a similar class of models 
without BAO (so called no- wiggle model in Eisenstein & Hu 
1998) and a model with zero correlation £ = 0. The no-BAO 
model has A\ 2 = 10 with respect to the best fitting ojCDM 
model while a model with £ = has A\ 2 = 4. Kazin et 
al (2010b) did an independent analysis of the SDSS catalog 
and found similar results for the correlation measurements 
and errors. In their interpretation they did not explore the 
parameter space of wCDM but conclude that there is no con- 
vincing evidence for radial BAO because the £ = model 
fit the data better than uCDM. They argue that there are 
no parameters in the f = model while for cjCDM sev- 
eral parameters where fitted in GCH. After including the 
penalty for adding parameters, they find that ojCDM is not 
significantly better than £ = 0. 

But a similar argument could be extended to the BAO 
monopole measurements. For example, if one fits a con- 
stant correlation to the LRG correlation function in Fig. 17 
of Sanchez etal (2009) to scales larger than 70 Mpc/h one 
finds that this model can not be distinguished from a ojCDM 
model with free parameters. The original Eisenstein etal 
(2005) results can also be well fitted with a power-law 
model 1 . Does this mean that the BAO feature has not been 
detected at all? These are important points to clarify as it 
is common practice to include BAO measurements when fit- 
ting cosmological models to provide evidence for dark energy 
models (eg Sanchez et al 2009; Komatsu et al 2010; Kazin 
et al 2010a; Gaztanaga, Miquel & Sanchez 2009). 

Other recent studies seem to reach a similar conclusion, 
that the BAO feature has not been detected, but using an 
argument that seems to go in the opposite direction. Rather 
than finding that data is too noisy and compatible with 
featureless models, they find that the data is not consistent 
with cjCDM (eg see Labini et al 2009, Labatie et al 2010). 
Also see Martinez et al 2009 for a study of peak detection 
using DR7 monopole. We will investigate this here to find, 
as in previous analysis (eg GCH, Sanchez et al 2009, Kazin 
et al 2010a) that data is in good agreement with cjCDM 
although we should stress that this statement will depend 
on the specific test we use. 

We will argue that there are two separate questions 
mixed up in the above line of argumentation: model selec- 
tion and parameter fitting. We will find that while current 
data can not be used to select ojCDM, one can still constrain 
the parameters of wCDM if this model is assumed. To show 
this, we will set out to address two main questions: 1) can 
we use current BAO data to favor ojCDM? In other words: 
is the volume of current data large enough to pass a null 
detection test to choose cjCDM over some other model? 2) 



can we constrain the parameters of the uCDM model, and 
in particular the BAO position with current data? 

We will investigate these points with a set of 216 mock 
galaxy catalogs extracted from MICE7680 (see Fosalba et al 
2008, Crocce et al 2009), one of the largest volume dark mat- 
ter simulation run to date. The mocks are made to match 
the SDSS LRG DR6 sample and should therefore provide a 
good representation of biased cjCDM realizations. We will 
use these mocks to explore the peak and the shape method 
applied to the monopole. We use the monopole here (rather 
than radial BAO) for several reasons: shape measurements 
have larger signal-to-noise, theoretical modeling of monopole 
is better understood (see GCH) and the monopole BAO has 
been more widely used to test cosmological models. Rather 
than comparing the wCDM with some add-hoc correlation 
(power-law, constant or some combination) we choose to fo- 
cus on comparing BAO and no-BAO models. This has the 
advantage of being a well defined procedure (quite standard 
in the literature) where we have the same number of param- 
eters in each case, which simplifies the interpretation of the 
statistical significance when comparing two different models 
with different number of parameters (eg see Liddle 2009). 

Throughout we assume a standard cosmological model, 
with fi M = 0.25,«a = 0.75, D. h = 0.044, n s = 0.95, cr 8 = 0.8 
and h = Ho/(100kms _1 Mpc _1 ) = 0.7. 



2 BAO IN GALAXY MOCKS 

Appendix A in Cabre & Gaztanaga 2009 (CG09 from now 
on) describes how our mocks were built and also how the 
correlation function is estimated. 1 We include both bias and 
redshift distortions in the mocks. We will focus here on the 
monopole correlation for halo z=0 mocks with a bias b ~ 2, 
similar to LRG galaxies. The correlation function for our 216 
mocks, its mean and errors are displayed in Fig[l] These 
mocks are realistic as they cover similar volume, densities 
and bias as the real LRG galaxies, but they have some limi- 
tations. In general, one needs first to explore the parameters 
in wCDM (and bias model) to get a good match to data. Our 
simulations have j3 = f(Q m )/b — 0.25 and z = 0, which are 
different from the values in real data /3 ~ 0.34 ± 0.03 and 
z = 0.35 (the difference in /3 comes from the difference in 
redshift, as bias is similar, see CG09). Depending on the test 
used, this could result in a poor fit of models to data. De- 
spite these limitations, we will find below a good fit of data 
to the mocks when we allow the amplitude to vary in the fit. 
This indicates that our mocks provide a good representation 
of the data, given the errors, at least for the questions we 
want to address here. 

In our analysis we will pretend that each mock is a 
realization of the real LRG data. Our mocks are close enough 
to the real data to provide a realistic representation of how 
much variation there is from one realization of real data to 
the other. Indeed the jack-knife (JK) errors (and covariance 
matrix) in the real data are similar to the JK errors in our 
mocks and to the ensemble variation from mock to mock. 
This was shown in CG09 and can also be seen in Fig[T]where 
we compare the ensemble variation in mocks (short dashed 
lines) to the JK errors in the DR6 SDSS LRG measurements 
from CG09 (note that we show DR6 to be consistent with 
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Figure X. Thin (black) lines show the correlation function £(r) 
(scaled by r 2 ) in each of our 216 mocks. Solid (green) line shows 
the BAO model (the mean of the mocks). Short dashed (green) 
lines encompass 1-sigma errorbars from the mean (these are mock 
to mock errors, the error of the mean would be 1 /\/216 better). 
Long dashed (blue) line shows the no-BAO model. The (red) er- 
rorbars correspond to the real LRG data shifted as shown in Eq^ 
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Figure 2. Histograms showing the distribution of differences in 
X 2 values for different LRG mocks. The correlation function in 
each mock is fitted with both the standard uCDM BAO correla- 
tion and with the no-BAO class of models. The difference between 
the two x 2 values in each mock is accumulated in the histogram. 
There are = 20 data bins in each fit, but only one parameter 
is fitted (the overall amplitude). The figure shows that only 20% 
of the cases show a significant preference at 3cr (ie Ax 2 < — 9) 
for the BAO model over the no-BAO model. The mean for mocks 
is Ax 2 = —5. A fit to the real LRG data gives also Ax 2 = —5, 
close to the maximum. 



the mocks, but similar results are found for DR7, see GCH). 
To compare to simulations we have scaled the LRG data as: 

i{r)^A[i{r) + K] (1) 

with A — 1.2 and K — —0.005. The value of A accounts 
for the differences between the simulation and LRG data in 
/3, growth and bias. The value of K represents a possible, 
but quite minor (0.25%), error (contamination or sampling 
fluctuation) in the overall mean density of the sample. This 
has little impact in the fit of models to data (covariance 
allows for a constant shift in the data) but improves the 
visual comparison in the figure (see Fig. 17 in Sanchez et al 
2009). As indicated by Fig[l]the mocks represent quite well 
the variation seen in the observational data. 

2.1 The shape method: null test 

We use two models to fit the correlation £(r): 1) the BAO 
model: it uses the mean of all the mocks in order to have 
a perfect BAO model (with bias, redshift space and non- 
linearities effects included). 2) the no-BAO model: a non- 
physical model that imitates well the broad band corre- 
lation but does not include a BAO peak. We use the no- 
wiggle power spectrum of Eisenstein& Hu (2001) with same 
cjCDM parameters as the simulation. Fig[T] compares the 
BAO (solid line) with the no-BAO model (long-dashed line). 
Our null test is: does the data prefer the BAO to the no- 
BAO model at 3-sigma confidence level (CL)? 

To simplify the analysis and interpretation, the only 
free parameter that we fit is the global amplitude A of the 
correlation, which includes a possible bias (as we are us- 
ing halos) and a constant redshift distortion boost (Kaiser 



1987). We use the correlation function £i( r j) measured in 
the i-th mock at separation rj to perform a x 2 fit an d find 
the best fit amplitude Ai for either BAO or no-BAO models 
(which are labeled generically as £m): 

x? = E K*fa) ^ Mm(ri)] c£ &(n0 - M m (r k )] (2) 

The indexes j and k run over the Nb = 20 bin separations, 
ie v — 19 degrees of freedom. Bins are linearly spaced with 
Ar = 5 Mpc/h between 30 and 130 Mpc/h (we find similar 
results in the range 20-150 Mpc/h). The covariance matrix 
Cjk is estimated from the mocks: 

1 216 

C » = 215 E l&fo) - [^( rfc ) ~ ( 3 ) 

i=l 

where f (rj) = ^ C»( r j) i s the mean value in bin j. 

The resulting distribution of values of x? f° r the BAO 
model peaks around xf — v — 19 and is quite broad (Ax 2 — 
■s/2u ~ 6, as expected). The no-BAO model peaks at larger 
values (xf 24) and is slightly broader (Ax 2 ^ 7.7). The 
real LRG data produces x 2 = 20 for the BAO model and 
X 2 = 25 for the no-BAO model, well within the values found 
for most of the mocks. Thus, given the large errorbars, the 
real data seems to match quite well our mocks, despite the 
differences in the modeled values of j3, bias and z mentioned 
above. 

In Fig[2]we plot the histogram of the differences between 
the Xi values in the fits to the BAO and no-BAO models 
for each mock. Negative values mean that the mock prefers 
the BAO model over no-BAO model. A difference at 3er CL 
between both models, ie Ax 2 < —9, only happens in the 
20% of cases (up to 30% when we explore other range of 
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Figure 3. Histogram showing the distribution of peak BAO po- 
sition measured in LRG mocks. This distribution has tbao = 
107.2 ± 8.8 Mpc/h as compared to r BAO = 107.5 Mpc/h in the 
mean model. The distribution is quite gaussian, as shown by the 
line crossing the histograms. The same measurement in real LRG 
DR6 data yields r B AO = 112 Mpc/h. 

scales). This means than in 80% of the cases one does not 
expect to be able to distinguish between the two models (at 
more than 3a CL). This result is not surprising. The mean 
difference in \ 2 between the BAO and no-BAO model is 
only A\ 2 — —5, which in comparable to the width of the 
X 2 distribution with 19 degrees of freedom. In other words, 
current errors are still too large to claim a BAO detection. 

2.2 The peak method: BAO position 

In the peak method, we assume that we live in a ojCDM 
universe and try to locate the BAO position. To keep things 
simple, here we locate the position of the peak by search- 
ing the maximum in the correlation function in the BAO 
scale, between 80-135Mpc/h (results are similar when we 
move around 70-150Mpc/h). The BAO feature is modified 
by the presence of the broad band (CDM) correlation func- 
tion, which can be modeled approximately by a power law. 
We fit a power law to each correlation function at small 
scales (10 - 70Mpc/h) and subtract the correlation function 
from the best power law before locating the peak. GCH use 
a very similar peak method but do not need to subtract the 
power-law because the correlation is quite flat (and close 
to zero) in the radial direction. Sanchez et al (2010) use a 
similar but more elaborated version, where they fit simulta- 
neously a power-law, a constant shift and a gaussian (BAO) 
peak. This would provide more accurate errors for the peak. 

Fig. [3] shows the distribution of recovered BAO posi- 
tions for individual mocks. This distribution is well approx- 
imated by a Gaussian (also shown in the figure). The mean 
BAO position is 107.2Mpc/h with a dispersion of 8.8Mpc/h, 
compared to mocks mean value position at 107.5Mpc/h 
(note that the resolution in the position of the peak is of 
5Mpc/h). The position of the peak can differ slightly (less 
than 2%) from the sound horizon scale at decoupling (see 
Sanchez, Baugh and Angulo 2008 and Sanchez et al 2010) 
depending on the cosmology, non-linearities, and other ef- 



fects. For WMAP parameters, very similar to MICE simu- 
lation, the sound horizon scale is at 107.3Mpc/h. 

We find similar results when we use a fixed power-law 
for all mocks, the one fitted for the mean correlation func- 
tion, or when using the best power-law for each mock, as one 
would do in real data. When we apply the same method to 
the real DR6 data we find tbao = 112Mpc/h, well within 
the bulk of our mocks. 

We have also tried the method to locate the BAO posi- 
tion propposed by Kazin et al (2010a). The mocks (or data) 
are fitted using a x 2 likelihood (including covariance) to a 
BAO model which consists in the mean of the mocks, £m(r), 
shifted by two free parameters: the amplitude A and a scale 
shift a, ie A£ m (ar). We find a very similar histogram to that 
in Fig|3]but with smaller errorbar: 6% instead of 8%. Kazin 
et al (2010a) further reduced this error to 3 — 4% by using 
only the mocks which have a clear BAO as in the DR6 data. 
We will obviusly get smaller errors by removing such outliers 
in our mocks but this later step involves more assumptions 
than just the existance of a peak. It not only assumes that 
we live in ojCDM, but selects in a subjective way (a poste- 
riori) within a subset of realizations. Also note that in this 
method we are using a priori knowledge of the shape of the 
input model to locate the peak. In the peak method, used 
in Fig. 3, we do not need to make such assumption and so 
we think this makes a stronger case for the point we want 
to demostrate, even when the error is larger. 

It is more robust and self-consistent to locate the BAO 
and error using the full shape of £(r) and a larger family of 
cosmological models, eg as shown in Sanchez et al (2009), 
avoiding any dependence on a particular cosmology in the 
algorithm to locate the peak. The point demonstrated here is 
that the BAO position is imprinted in the mocks despite the 
fact that they do not pass a null detection test. A comparison 
between methods is left for future analysis. 



3 CONCLUSION 

The first question we set out to address was if the volume of 
current BAO data is large enough to pass a null detection 
test for ojCDM. The answer to this question seems negative. 
We have shown in Fig. 2 that the distribution of x 2 differ- 
ences is quite broad and one could find mocks for which the 
null test is passed or failed. In fact 80% of the mocks have 
Ax 2 > —9, which indicates no statistically significant (at 
3-sigma CL) preference for the true BAO input model as 
compared to the featureless no-BAO family. Current SDSS 
(DR6-DR7) data seems to lie close to the peak of this distri- 
bution, Ax 2 — —5, but according to Fig.2 this does not pro- 
vide convincing evidence for the BAO model. As expected, 
the DR3 results in Eisenstein et al 2005 (about half of the 
DR6 volume) is even less significant: Ax 2 — —1.1. When we 
compare the BAO model to a power-law fit (with 2 param- 
eters) we find x\ao ~ X^ower-law - 0-4. 1 

Our mocks have slightly different values of /3 and z than 
the DR6 data (see Fig.l) and we wonder if this could af- 
fect the above conclusion. The important point to notice is 
that the BAO and no-BAO model also have a similar dif- 
ference of Ax 2 (ie — —5) when we compare to the DR6 
data (using the same bins and covariance as in the mocks). 
Models with other cosmological parameters within the un- 
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certainties of cjCDM also produce similar A\ 2 ■ But a value 
of A\ 2 — —5 is comparable to the width of a \ 2 distribution 
with 19 degrees of freedom (which is A\ 2 — 6). This is why 
the result is not significant. Our conclusion is quite robust 
and mostly relays in the size of the errors and covariance 
between bins. The covariance estimate is consistent in the 
data (ie from Jack-knife subsamples), in our mocks and in 
other mocks produced by several groups (eg Eisenstein et 
al 2005, CG2009, Kazin et al 2010a). We would need \/L8 
times smaller errors, ie 1.8 times more data or some optimal 
weighting (Hamaus et al. 2010, Cai etal 2010), than DR6 
(so that A\ 2 increases from 5 to 9) to be able to claim a 3 
sigma BAO detection in the monopole. 

For the radial BAO analysis, GCH reach similar con- 
clusions. They find that the difference A\ 2 — —10 in DR6 
for 20 degrees of freedom (5Mpc/h radial bins within 40- 
140 Mpc/h) when comparing BAO (plus magnification) and 
no-BAO models (without magnification the difference is 
A\ 2 — —6). This seems more significant than the monopole, 
probably because the radial BAO peak is boosted by redshift 
space distortions and magnification. 

Does this mean that the BAO position can't be mea- 
sured? If we assume the cjCDM model, we can locate the 
BAO position to better than 8% of the true value, as illus- 
trated in Fig.3. We show that in 50% (100%) of the mocks 
we can find the BAO position within 5% (20%) of the true 
value. This error is an upper bound as we have not tried 
to optimize the method to locate the peak. We have com- 
pared the mocks with the real data and found no evidence 
for deviations away from the cjCDM. None of the 216 cjCDM 
realizations is identical to the measurements (or in fact to 
each other), but observations produce values that lie well 
within the histograms in Fig. 2 and Fig.3 for the two simple 
but generic tests that we have explored here. 

Lessons learned in this study can be applied to the 
monopole BAO analysis (eg Eisenstein et al 2005, Sanchez 
et al 2009, Percival et al 2010, Kazin et al 2010a) and the 
radial BAO in GCH (or the BAO in the 3-point function 
by Gaztanaga et al 2009). Kazin et al (2010b) have argued 
that because uCDM does not fit the radial BAO data signif- 
icantly better than a model with £ = (null test), the H(z) 
measurements presented by CGH based on the location of 
the radial BAO peak can not be regarded as a detection. 
We have shown here that his argument is not necessarily 
correct. If we apply such argument to the monopole BAO 
measurements previously cited we would conclude that we 
can not locate the peak position in current data because ac- 
cording to Fig. 2 there is no significant BAO detection. But 
we have shown here that we can locate the BAO position 
with reasonable accuracy even with data that fails the null 
BAO detection test. A similar analysis was done with Monte 
Carlo mocks in GCH for the radial BAO position. Even if 
the shape method gives low significance, the peak method 
can still be used to detect the position of the peak. 

Tian et al 2010 reaches similar conclusions for the radial 
BAO peak using different simulations. They use a wavelet 
technique to detect the peak and asses the significance of the 
detection, splitting SDSS into slices in various rotations. 

Current BAO measurements can not yet be used to se- 
lect wCDM, but they can be used to locate the BAO position 
(or other cosmological parameters) if one assumes wCDM or 



models which produce similar clustering (and errors) to the 
ones in wCDM. Q 
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